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Large deviation approach to nonequilibrium systems 

Hugo Touchette and Rosemary J. Harris 



Abstract. The theory of large deviations has been applied successfully in the last 30 years or so to 
study the properties of equilibrium systems and to put the foundations of equilibrium statistical mech- 
anics on a clearer and more rigorous footing. A similar approach has been followed more recently 
for nonequilibrium systems, especially in the context of interacting particle systems. We review here 
the basis of this approach, emphasizing the similarities and differences that exist between the applic- 
ation of large deviation theory for studying equilibrium systems on the one hand and nonequilibrium 
systems on the other. Of particular importance are the notions of macroscopic, hydrodynamic, and 
long-time limits, which are analogues of the equilibrium thermodynamic limit, and the notion of stat- 
istical ensembles which can be generalized to nonequilibrium systems. For the purpose of illustrating 
our discussion, we focus on applications to Markov processes, in particular to simple random walks. 



1.1 

Introduction 

Nonequilibrium systems are being increasingly studied using methods borrowed from 
the mathematical theory of large deviations, as developed in the 60s and 70s by Don- 
sker and Varadhan, and Freidlin and Wentzell (see (llUU for historical references). 
Indeed the central concepts and quantities of this theory - e.g., the large deviation 
principle, rate functions, generating functions, etc. - have now entered the standard 
jargon of driven nonequilibrium systems modelled as discrete- or continuous-time 
Markov processes (see, e.g., d,0,[l]). 

With hindsight, one can argue that this evolution, although relatively recent, was 
to be expected: large deviation theory has been used successfully in equilibrium stat- 
istical mechanics for well over 30 years and so it is not surprising that 
this success finds its way into nonequilibrium statistical mechanics. However, there 
is more, in that the two scenarios - equilibrium and nonequilibrium - share many 
ideas, concepts and even a theoretical structure which happen to find a clear and pre- 
cise expression in the language of large deviations. It is natural, therefore, to see this 
language being used for both theories. 

By viewing equilibrium statistical mechanics from the point of view of large devi- 
ation theory, one gets a clear sense, for example, of why there is a Legendre transform 
in thermodynamics connecting the entropy and the free energy, when this Legendre 
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transform is valid, how equilibrium states relate to the notions of concentration and 
typicality, and how these states arise out of variational principles such as the max- 
imum entropy principle or the minimum free energy principle. Similar ideas and 
results of nonequilibrium statistical mechanics are also made clear by viewing them 
through the prism of large deviation theory. In addition, one sees, as mentioned, 
essential similarities between equilibrium and nonequilibrium. 

Our goal in this chapter is to explain these points and to illustrate them with simple 
examples, mainly involving continuous-time Markov processes. We start in the next 
section by recalling the basis and essential concepts of equilibrium statistical mech- 
anics, and by discussing their analogues in nonequilibrium statistical mechanics. 
Among these, we mention the notions of statistical ensemble, stationarity, typical- 
ity, fluctuations, and scaling limit (e.g., thermodynamic limit, hydrodynamic or mac- 
roscopic limit, long-time limit). In Sec. 11.31 we re-express these concepts in the 
language of large deviations to define them in a precise, mathematical way and to 
emphasize the theoretical structure that underlies both equilibrium and nonequilib- 
rium statistical mechanics. We illustrate this structure with a variety of applications in 
Sec. ll.4l and then end in Sec. ll.5l with some concluding remarks and open problems. 

Much of the large deviation content explored in the present contribution can be 
found with more details in 0]. Here we focus on discussing the common goals, con- 
cepts, and results of equilibrium and nonequilibrium statistical mechanics, rather than 
providing a complete review of either subject, and on proposing a clear approach to 
studying nonequilibrium systems which parallels that used for studyi ng equi librium 
systems. We draw inspiration in doin g so from the works of Oono I9L I 1CX 1 1 Ifl . Eyink 
CI, El, 03], and Maes et al. QEEUSH], among others, which show the emer- 
gence of similar ideas and views as early as the late 80s. 



1.2 

From equilibrium to nonequilibrium systems 

Before we discuss how large deviation concepts enter in equilibrium and nonequilib- 
rium statistical mechanics, we recall in this section the basis of each theory and em- 
phasize some concepts shared by both. Of these, the most important to keep in mind 
is the concept of typicality, connected mathematically to the Law of Large Numbers 
and the concentration of probability distributions. 

1.2.1 

Equilibrium systems 

The goal of equilibrium statistical mechanics, as is well known, is to explain and 
predict the emergence of macroscopic equilibrium states of systems composed of 
many particles by treating their microscopic states in a probabilistic way. The main 
properties of equilibrium states are that they are stationary in time, they are stable 
against small perturbations, and are described by only a few variables, i.e., they are 
low-dimensional macrostates compared to the high-dimensional mfcrastates used for 
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describing the many-particle system at the microscopic level. 

Equilibrium states are also defined with respect to a given macrostate. Physicists 
often say that a system is at equilibrium or is in a state of equilibrium, but what is 
meant, to be more precise, is that the system has, say, an equilibrium energy or an 
equilibrium magnetization. Thus an equilibrium state is a particular state or a value 
of a macrostate or a collection of macrostates. This is different from saying that a 
system is an equilibrium system. We shall give a definition of the latter concept later, 
after describing the analogue of an equilibrium state for nonequilibrium systems. 

For now, let us recall how equilibrium states are modelled in statistical mechanics. 
The basic ingredients are well known. Consider a system of N particles, which for 
simplicity we take to be a classical system, and let the sequence ui = (uix, U2 , ■ ■ ■ , wjv) 
denote the microscopic configuration or microstate of the system, where u>i is the 
state of the zth particle. To study the statistical properties of this system, we consider 
a prior probability distribution P(u), interpreted as the stationary distribution of the 
microscopic dynamics. The state space of one particle is denoted by A, so that P(uj) 
is a probability distribution over the N -particle space Ajy = A N . 

Next, we consider a macrostate Mjv, corresponding mathematically to a function 
Mn(lu) of the microstates, and proceed to compute the probability distribution of 
this random variable obtained under P(lj) using 

P{M N = m)= / S(M N (u) -m)P(u)du. (1.1) 

JAn 

If P(oj) is a valid model of an equilibrium system and the macrostate Mjv is chosen 
properly, then what should be observed is that P(Mjy) is concentrated around certain 
highly probable values and that this concentration gets more pronounced as TV gets 
larger. It is these most probable or typical values (points or states) of Mjq that we 
call equilibrium states of . 

Mathematically, the concentration of P{M^) is akin to a Law of Large Numbers, 
in the sense that there exist sets B of values of Mjv such that 

lim P(M N £fi) = l and lim P(M N £B) = Q. (1.2) 

TV— >oo TV— >oo 

The smallest set B having this property corresponds to the set of equilibrium values 
of Mjv or, more loosely, the set of equilibrium states of the system (as defined with 
respect to Mat). 

We shall see in the next section that an essential property of the concentration of 
P(Mjv) on B is that it is exponential as a function of N (or, more generally, the 
volume of the system), which means that the probability that Mjv deviates from one 
of its equilibrium values is exponentially small with the system size, N. Physically, 
these deviations are termed fluctuations, and so we say that the probability of fluctu- 
ations from equilibrium states is exponentially small with N. 

The exponential concentration of P(Mjv) explains why large deviation theory is 
used in equilibrium statistical mechanics. Physically, it is also the reason why equilib- 
rium states correspond to typical values of Mpj and not, as often claimed, to average 
values of Mjy. The fundamental property of equilibrium states is indeed that they 
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do not fluctuate, or at least appear not to at the macroscopic level, so the fact that 
Mjv has a well-defined average value is obviously not enough: Mjv must converge 
in probability to some typical values. 

The reason why average values are used in statistical mechanics is arguably that 
they are conceptually simpler than typical (or concentration) values and that, if M n 
has a unique typical value, then its average is the same as its typical value (we say 
in this case that Mjv is self-averaging). However, the use of averages is somewhat 
misleading as it detracts us from the essential property of equilibrium states, which is 
again that these states arise probabilistically from the concentration of a probability 
distribution. Equilibrium states are first and foremost typical states arising from the 
scaling limit that is the thermodynamic limit Jl9ll2fj|| . 

This is an important point which leads us to the discussion of extensivity versus 
intensivity. Physically, it should be clear that the total energy Hjy of an TV-body sys- 
tem with short-range interactions does not concentrate in the thermodynamic limit, 
simply because the energy is extensive for such a system, and so diverges with TV. 
As a result, one cannot formally say that the system has an equilibrium energy in the 
thermodynamic limit. Rather, the correct macrostate having an equilibrium value, 
i.e., the one that concentrates in the thermodynamic limit, is the energy per particle 
or mean energy = Hjy/N which is an intensive quantity. 

To explain this point more clearly, let us consider a sum 



of TV independent and identically distributed random variables X\,..., Xjj. If the 
mean {Xi) — /1 of these random variables is finite, then as TV — > 00 the distribution 
of Sn/N concentrates to the mean in such a way that 



for all e > 0, in accordance with the Law of Large Numbers. The point to note about 
this result, which is equivalent to the second limit shown in dl.21 ). is that it holds for 
the mean sum Sat/TV and not for the sum Sjy: the distribution of P(Sjy) does not 
concentrate; in fact, it flattens as TV — > 00. Similarly, it is easy to show that the distri- 
bution of 5tv/TV" flattens for all a £ (0, 1) and concentrates in a trivial way at zero 
for all a > 1. Hence, the only normalization of a sum Sjv of independent (or near in- 
dependent) random variables with finite mean that yields a non-trivial concentration 
point is Sff /TV. 

The same observation applies to equilibrium states. The reason again why equi- 
librium states appear stable at the macroscopic level is that the fluctuations around 
these states are very improbable and become the more unlikely the bigger the sys- 
tem gets. Mathematically, the way to make sense of this observation is to consider 
random variables that have the property of concentrating in the thermodynamic limit 
TV — > 00. From this point of view, the total energy Hjy is not a "good" macrostate 
to consider because it does not concentrate in this limit. The same goes, similarly 



N 




(1.3) 



lim P(\S N /N-fi\ > e) =0 



(1.4) 



Hugo Touchette and Rosemary J. Harris: Large deviation approach to nonequilibrium systems — 
Chap. 1 — 2012/4/25 — 0:18 — page 5 



to Sn, for the macrostates Hpf/y/~N or Hjy/N 2 : the distribution of the former flat- 
tens, whereas the distribution of the latter concentrates trivially to zero. We get a 
non-trivial concentration only for Hpf /N. 

This at least is generally true for short-range interacting systems. For long-range 
interacting systems, such as gravitational systems or mean-field systems, the "good" 
energy macrostate to consider might be /N 2 or, more generally, Hjq /N a with 
a > 2 {2lll . The choice of a will depend on the system considered, but the require- 
ment again is that the distribution of the macrostate that is studied should concen- 
trate when N — > 00. From this point of view, one might have to consider different 
thermodynamic (or scaling) limits and macrostates in order to correctly describe the 
equilibrium states of those systems. 

1.2.2 

Nonequilibrium systems 

The study of nonequilibrium systems is conceptually more difficult than that of equi- 
librium systems because one is interested in describing not only the stationary fluc- 
tuations, as is done for equilibrium systems, but also the fluctuating dynamics of the 
system arising in time and the fluctuations of macrostates or observables integrated 
over time. Thus, in addition to considering the number of particles present in a sys- 
tem (or its volume), one also needs to consider the evolution of that system in time. 
This implies that different scaling limits may be taken depending on the system and 
macrostate or observable studied. 

For definiteness, we consider here nonequilibrium systems modelled by Markovian 
processes. To simplify the presentation of these models, we assume for now that the 
stochastic evolution takes place in discrete time (although continuous-time models 
will also be discussed in the following sections). In this case, the microstate u that 
represented before the configuration of an equilibrium system at a fixed (yet unspe- 
cified) instant of time is now a complete trajectory uj — {uJi}^ = i consisting of n 
timesteps. The assumption that the process is Markovian then amounts to assuming 
that the prior distribution P(ui) can be decomposed according to a Markov chain 

P(w) = P(wi)P(W2|o;i) . . . P(uj n \Un-l), (1.5) 

with initial distribution P{uj\) and transition matrix elements P(wj|wj_i), which, 
in most cases, are assumed to be time-homogeneous (i.e., time-independent). This 
form of prior is, from a pragmatic point of view, our stochastic model for uj from 
which all distributions of macrostates or observables are computed. Thus, so far, the 
formalism is abstractly the same as for equilibrium systems: a system is described by 
its microstate uj and a probability distribution P(w) on the space of microstates. What 
changes for nonequilibrium systems is the interpretation of uj as a time-trajectory of 
a system of one or more particles. 

This difference allows us to consider many types of macrostates or observables. 
For example, one can consider a fixed-time or static observable M(wj) which is a 
function of the state of the system at a specific timestep i. One can also consider 
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dynamic observables of the form 



n z — ' 



(1.6) 



i=i 



referred to mathematically as additive observables or additive Junctionals [3[], which 
involve states at different times. Another type of dynamic observable, which arises 
in the context of particle currents, is 



Whatever the observable chosen, the goal when studying nonequilibrium systems 
is to compute the probability distribution P(M = m) of a given observable M start- 
ing from the prior distribution P(w) defining our model of that system, and to see 
whether this distribution concentrates, in some scaling limit, over specific values of 
M. The scaling limit that needs to be considered depends on the system and ob- 
servable chosen: it can be the infinite-volume limit N — > oo for a fixed-time ob- 
servable M(ui), the infinite-time limit n — > oo for additive or current-like observ- 
ables, or a combination of these two limits if the latter observables involve many 
particles. Other limits can also be conceived, e.g., the small-noise limit of dynamical 
systems perturbed by noise, the continuous-time limit of discrete-time systems, or 
the continuous-space limit of discrete-space systems (J. 

Some of these limits will be explored in the following sections. The essential point 
to note is that these scaling or hydrodynamic limits are expected to give rise to a 
concentration of the probability distribution P(M) similar to the one discussed for 
equilibrium systems, and thus to the emergence of typical states for the system or 
observable studied. 



Equilibrium versus nonequilibrium systems 

So far we have not attempted to distinguish equilibrium from nonequilibrium systems 
in any precise way other than to hint that the distribution P(ui) describes a "static" 
random variable in the case of equilibrium systems and a "dynamic" random variable, 
i.e., a stochastic process, in the case of nonequilibrium systems. But what makes a 
system an equilibrium or a nonequilibrium system? 

To answer this question in a mathematical way, we need to consider the stochastic 
time evolution of a system and study how the prior distribution P(uj) of its com- 
plete trajectory oj behaves when the time ordering of oj is reversed. To be more 
precise, consider the discrete-time trajectory uj = (cji,cj2, . . . , w„_i, uj n ) contain- 
ing n timesteps, and define the time-reversed trajectory lu r associated with cj as 
the trajectory obtained by re-ordering the states of ui in reverse order, i.e., uj r = 
(uj n ,u> n -l, . . . ,UJ2, wi). Then we say that the system modelled by P(to) is an equi- 




(1.7) 



i=l 



1.2.3 
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librium system if P(u) = P(uj r ) for all uj [4]0 If this condition is not satisfied, then 
we say that the system is a nonequilibrium system. Mathematically, this condition is 
equivalent to the notion of reversibility or detailed balance, stated here at the level 
of complete trajectories rather than the more usual level of transition rates. Thus, the 
stochastic dynamics of an equilibrium system satisfy detailed balance, whereas those 
of a nonequilibrium system do not. 

The rationale behind this definition is that equilibrium prior distributions, such 
as the microcanonical and canonical distributions, are stationary distributions of 
stochastic dynamics verifying detailed balance, and that the very notion of detailed 
balance captures the physical observation that equilibrium systems are systems in 
which fluctuations arise with no "preferred" direction in time. Nonequilibrium 
systems, by contrast, have a stochastic dynamics that is not symmetric under time 
reversal. This does not mean that nonequilibrium systems do not have stationary 
distributions; they often do, but the form of these distributions is generally much 
more complicated than equilibrium distributions. 

1.3 

Elements of large deviation theory 

We show in this section how the mathematical theory of large deviations makes pre- 
cise the observation that probability distributions of macrostates concentrate expo- 
nentially with some scaling parameter (e.g., number of particles, volume, integration 
time, noise power, etc.). This exponential concentration is the source, for equilib- 
rium systems, of the Legendre transform connecting the entropy and the free energy, 
and, therefore, of the Legendre structure of thermodynamics. For nonequilibrium 
systems, it also gives rise to a Legendre transform between quantities that are the 
nonequilibrium analogues of the entropy and the free energy. 

1.3.1 

General results 

To explain the central ideas and results of large deviation theory, we first consider a 
general random variable or macrostate A n indexed by the parameter n which can, for 
example, be the number of particles or the number of timesteps. 

The starting point of large deviation theory is the observation that the probability 
distribution P(A n ) of A n is, for many random variables of interest, decaying to zero 
exponentially fast with n. The exponential decay is in general not exact; rather, what 
often happens is that the dominant term in the expression of P(A n ) is a decaying 
exponential with n, so that we can write 

P(A n = a)^e- nI ^\ (1.8) 

with 7(a) the rate of decay. When P(A n ) has this form, we say that P(A n ) or A n 



1) For simplicity, we assume that the cj's themselves have even parity under time-reversal. 
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satisfies a large deviation principle (LDP). To be more precise, we say that P(A n ) or 
A n satisfies an LDP if the limit 

lim ln.P(A n = a)=I(a) (1.9) 

n— >oo Ti 

exists. The decay function 1(a) defined by this limit is called the rate function. The 
factor n in the exponential is called the speed of the LDP QBIS]. 

The interest in large deviations arises because many random variables and stochastic 
processes satisfy such a principle (although not all do; see, e.g., (3)]). The goal of 
large deviation theory, in this context, is to provide methods for proving that a given 
random variable or process satisfies an LDP and for obtaining the rate function 
controlling the rate of decay of the LDP. 

Among these methods, let us mention two that are especially useful. The first is 
known as the Gartner-Ellis Theorem |Q| and proceeds by calculating the following 
function: 

A(fc) = lim -\n(e nkA "), (1.10) 

n— >oo n 

known as the scaled cumulant generating function (SCGF). The statement of the 
Gartner-Ellis Theorem, in its simplified form, is that, if A(fc) is differentiable for all 
k £ R, then A„ satisfies an LDP with rate function given by the Legendre-Fenchel 
transform of X(k): 

1(a) = max{fca- A(Jfc)}. (1.11) 

In many physical applications, X(k) is actually differentiable and strictly convex, and 
in this case the Legendre-Fenchel transform reduces to the better-known Legendre 
transform, written as 

1(a) = k(a)a- \(k(a)), (1.12) 

with k(a) the unique solution of A'(fc) = a. 

The Gartner-Ellis Theorem is useful in practice because it bypasses the direct cal- 
culation of P(A n ). By calculating the SCGF of A n and by checking that this function 
is differentiable, we instantly prove that P(A n ) satisfies an LDP and obtain the rate 
function controlling the concentration of P(A n ) in the limit n — > 00. 

For certain random variables, A(fc) can be calculated but is not differentiable; see 
(il] for examples. In this case, it can be proved that the Legendre-Fenchel transform 
of \(k) yields only the convex envelope of 1(a) |3|]. To obtain the full rate function, 
one may use other methods, such as the contraction principle I2H3H8I I22I1 . The basis 
of this method is to express A n as a function f(B n ) of some random variable B n 
satisfying an LDP with rate function J(b), i.e., 

P(B n = b)^e- nJ{b) . (1.13) 

If such a random variable and function can be found, then the contraction principle 
states that A n also satisfies an LDP with rate function given by 

1(a) = min J(b). (1.14) 

6:/(6)=a 
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The minimization appearing in this result is a natural consequence of the approx- 
imation method known as Laplace's principle USB as applied to the integral 

P(A n = a) = [ P(B n = b)db. (1.15) 

Jb:f(b)=a 

Assuming that the probability distribution P(B n ) decays exponentially with n, this 
integral is dominated by the largest exponential term such that f(b) = a, which means 
that we can write 

P(A n = a) « exp ( -n min J(b) ) (1.16) 

V b:f(b)=a J 

with sub-exponential correction factors in n. Thus we see that P(A n ) satisfies an 
LDP with rate function given by Eq. d 1 . 1 4b . 

The name "contraction" arises in the context of this result from the fact that the 
function / can in general be a many-to-one function, in which case we are "contract- 
ing" the fluctuations of B n down to the fluctuations of A n in such a way that the 
probability of the fluctuation A n = a is the probability of the most probable (yet 
exponentially improbable) fluctuation B n = b leading to A n = a. 

This interplay between the appearance of exponentially small terms in integrals 
and the possibility to approximate these integrals by their largest term using Laplace's 
principle also explains the appearance of the "max" in the Gartner-Ellis Theorem and 
the Legendre transform connecting rate functions and SCGFs 0]. In this sense, large 
deviation theory can be thought of as a "calculus" of exponentially decaying probab- 
ility distributions, connecting the properties of integrals such as (e nkA " n ), which are 
exponential in n, with the exponential properties of P(A n ) itself. 



1.3.2 

Equilibrium large deviations 



The application of the results stated above to equilibrium systems is straightforward. 
For definiteness, we consider a general macrostate Mn(uj) involving N particles and 
study its probability distribution Pp(M]y) in the canonical ensemble defined by the 
prior probability distribution 

-f3H N (u>) r 

P {u)= e , Z N {p)= e- pB "^du, (1.17) 

Z n{P) J An 

where Hjy is the Hamiltonian of the system considered. 
If Pp{M N ) satisfies an LDP, then the limit 

lim -— \nP(M N = m) = I s (m) (1.18) 

N->oo N 

exists and defines the rate function Ip (m) of Mjv in the canonical ensemble at fixed 
inverse temperature /3. The SCGF associated with this rate function is 

Xp(k)= lim l]n(e NkMN ) p , (1.19) 
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where 

/ NkM N \ I NkM N (u)r> , \ , om 

(e )p= e Ny 'Pp((j)du>. (1.20) 
Ja n 

If Xg(k) is differentiable in k, then by the Gartner-Ellis Theorem we have that Ig{m) 
is the Legendre-Fenchel transform of Xg(k). 

The connection between the LDP of Mpj and its equilibrium values comes directly 
by writing the LDP informally in the form 

Pg(M N =m)^e- NI ^ m) . (1.21) 

Since rate functions are always positive, this result shows that Pg(Mj\r) decays ex- 
ponentially fast with TV except at points where Ig(m) vanishes. As noted before, 
these points must correspond to the points where Pg(M^) concentrates in the limit 
TV — > oo, and so to the equilibrium values of Mjy. Mathematically, we therefore 
define the set Eg of equilibrium values of Mjv in the canonical ensemble as the set 
of global minima and zeros of the rate function Ig(m): 

£g = {m: Ig(m) = 0}. (1.22) 

A similar definition can be given for the equilibrium values of Mjy in the microcanon- 
ical ensemble or any other ensemble by replacing PpioA with the prior probability 
distribution defining these ensembles; see Sec. 5.3 of |3fl. 

The rate function describes of course not only the equilibrium states but also the 
fluctuations around these states. In particular, if the rate function Ig(m) admits a 
Taylor expansion of the form 

Ig(m) = a(m - m* f + 0(\m - m*| 3 ) (1.23) 

around a given equilibrium value m* , then the small fluctuations of M pj around m* 
are Gaussian-distributed. The rate function, however, is rarely an exact parabola 
which means that the larger fluctuations of Mjy away from m* are in general not 
Gaussian-distributed. Their distribution is determined by the rate function Ig(m) 
and depends on the system studied. 

This explains the word "large" in large deviation: contrary to the Central Limit 
Theorem, which gives only information about the distribution of random variables 
around their mean, large deviation theory gives information about this distribution 
near the mean but also away from the mean - i.e., it gives information about both the 
small and the large fluctuations or deviations of random variables. From this point of 
view, large deviation theory can be thought of as generalizing both the Law of Large 
Numbers and the Central Limit Theorem. 

These observations are valid for any random variable. A more specific connection 
with equilibrium systems can be established by studying the large deviations of the 
mean energy hpj = Hpj/N with respect to the uniform distribution P(w) — 1/|Ajv|. 
In this case, the integral 

P{h N = u)=l 5(h N (u)-u)P(cj)du (1.24) 
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is, up to a multiplicative constant, the density of states giving the number of micro- 
states having a given mean energy hpf(co) = u. For most (if not all) equilibrium 
systems, the density of states is known to grow exponentially with N (or, more gen- 
erally, the volume) and this directly implies an LDP for P(/ijv = u), which we write 

as 

P(h N =u)^e Ns{u) . (1.25) 
In this form, it is clear that the function s(u) obtained with the limit 

s(u)= lim 1- In P(h N = u), (1.26) 

is the thermodynamic entropy associated with hpj. To be more precise, it is the en- 
tropy of /ijv, as usually calculated from the density of states, minus an unimportant 
additive constant; see Sec. 5.2 of Q]. 

To complete the connection with thermodynamics, note that the generating func- 
tion 



{e i^n N)= e NkhN ^> P{u)duj (1.27) 



JA N 

can be interpreted as the canonical partition function, whereas the SCGF of ftjv, 
defined as 

A(fc)= lim ±\n(e Nkh »), (1.28) 

iv— foo iv 

can be seen as the analogue of the free energy function of the canonical ensemble. 
To be more precise, re-define the partition function Zjv(/3) by including the prior 
uniform distribution P(u) in the integral over Ajy: 

ZnW)= ( e-P HN( - u) P{u)du) (1.29) 



and define the free energ^ by 



<p(fl = lim -iln^(/3). (1.30) 

Af->oo iv 

Then, it is easy to see that (p(/3) = — X(k) with k — —/3. From this connection, it is 
also easy to see that the Gartner-Ellis Theorem implies that, if ip(/3) is differentiable, 
then 

s(u) = mm{f3u-tp(l3)}. (1.31) 

This Legendre-Fenchel transform, with its inverse transform expressing ip(f3) in terms 
of s(u) (see Sec. 5.2 of 01), is the formal expression of the Legendre transform ap- 
pearing in thermodynamics. The large deviation derivation of this transform makes it 



2) In thermodynamics, the free energy is more commonly defined with an additional factor 1/0 in front 
of the logarithm. 
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clear that it is valid under specific mathematical conditions (viz., the differentiability 
of (p) and that it arises because of the exponential nature of both P{h^) and Z^(/3) 
and the Laplace's principle linking these two functions in the thermodynamic limit. 
In this sense, the Legendre transform of thermodynamics does not arise out of any 
physical requirement - it is a consequence of the large deviation structure of statistical 
mechanics, which appears in the thermodynamic limit. 



1.3.3 

Nonequilibrium large deviations 



Let us now consider a nonequilibrium macrostate or observable M n (uj) involving 
n timesteps of a Markov process cj = (u>i, U2, ■ ■ ■ , ^n) described by the transition 
matrix elements P(wj|a;j_i). The large deviation properties of M n can be studied 
similarly as done for equilibrium macrostates by calculating the SCGF A(fc) asso- 
ciated with M n and by obtaining the rate function of M n as the Legendre-Fenchel 
transform of X(k) provided that \(k) is differentiable. 

The Markov structure of the process underlying M n can be used here to obtain 
more explicit expressions for X(k). In the case of the additive observable shown in 
Eq. ill. 6b . for example, we have 

A(fc)=lnC(P fe ), (1.32) 

C(Pfc) being the largest eigenvalue of the so-called tilted transition matrix P k with 
elements 

i^k-l) = P(u i \u i - 1 )e kf( ' u *\ (1.33) 



For the current-like observable M n shown in Eq. dl.7t . we have the same result but 
with P fc now given by 

P k (ujM-i) = P(u i \u i -i)e kf( ' u *' u *-^. (1.34) 

These results are valid if the state-space of the Markov chain is bounded. For un- 
bounded state-spaces, A(fc) is not necessarily given by the logarithm of the dominant 
eigenvalue of P fc . We shall see a related example in Sec. 1 1.4. 41 

For Markov processes evolving continuously in time, the above statements translate 
into the following ones. For an additive functional of the form 

M T (a/) = iy f(uj t )dt, (1.35) 

the SCGF A(fc), calculated in the limit T — > oo, is given by the largest eigenvalue of 
the tilted generator, 

G k (u\ u) = G{u',u) + kf(u)5 u , iU , (1.36) 

with G(u , uj) the elements of the generator of the original process. Note the absence 
of the logarithm here, as we are dealing with the generator, not the transition matrix. 
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For current-like observables having the form 

N(T)-1 

M t (uj) = — /K."** +1 ) ( L3? ) 

i=0 

where the sum is over the random transitions between states happening at times 
{to, t\, . . . , iAr(T)-i}> M/c) is given instead by the largest eigenvalue of a tilted gen- 
erator with elements 

G fe <y ,u) = G(u' ,u)e kf{ - w ' w ' ] . (1.38) 

Observables involving other scaling limits, in addition to the time or particle num- 
ber limits, can be treated in a similar way, as discussed for example in Sec. 11.4.51 In 
all cases, the LDPs that we obtain give us information about the fluctuations of the 
observable of interest similar to the information obtained from equilibrium LDPs. 
In particular, the global minima and zeros of the rate function determine the typical 
values of that observable, which are physically interpreted as typical steady states 
or hydrodynamic states or equations, depending on the observable studied. Then, as 
for random variables in general, the shape of the rate function around its minima de- 
termines the behaviour of the small and large fluctuations of the observable around 
its typical values. 

With the knowledge of the rate function of an observable, it is possible for example 
to determine whether this observable satisfies the so-called fluctuation relation sym- 
metry. Consider, to be specific, an observable My integrated over the time T and 
assume that Mj- satisfies an LDP with rate function J(m). We say that Mj- satisfies 
a Gallavotti-Cohen-type fluctuation relation if 

P(M T = m) ^ eTc ™ (139) 



P(M T = -m) 

with c a positive constant. This means that the positive fluctuations of are ex- 
ponentially more probable than negative fluctuations of equal magnitude. In large 
deviation terms, it is easy to see that a sufficient condition for having this result is 
that I(m) satisfy the following symmetry relation: 

I{-m) - /(to) = cm. (1.40) 

In terms of the SCGF, we have equivalently 

\(k) = A(-fc-c). (1.41) 

The next section includes an example of a very simple Markov process having this 
fluctuation symmetry. For other more complicated examples, see, e.g., I3I I23I1 . 



1.4 

Applications to nonequilibrium systems 

In this section we aim to illustrate, more concretely, how the large deviation formal- 
ism of the preceding section can be applied to nonequilibrium systems as introduced 
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in Sec. 11.2.21 We concentrate on Markov processes in continuous time, providing a 
detailed pedagogical treatment for toy models of random walkers and indicating how 
the same techniques can be applied to more complicated (and hence more interesting) 
models of interacting particles. Among other sources we draw here on the compre- 
hensive review of Derrida Q] in which further details of many-particle applications 
can be found. 



1.4.1 

Random walkers in discrete and continuous time 



To fix ideas, let us start by analysing perhaps the simplest possible model - a ran- 
dom walker in discrete space. Specifically, we consider a particle moving on a one- 
dimensional lattice of L sites with, for now, periodic boundary conditions. The micro- 
state of the model is the particle's position u £ {1, 2, . . . ,L} and, in discrete-time, 
the probabilities to move between those positions P(aji |cji_i) are contained in the 
transition matrix P. We assume that the particle has probability p per timestep, with 
< p < 1, to move in the clockwise direction and probability q per timestep, with 
< q < 1 — p, to move in the anti-clockwise direction. Note that, if p + q < 1, 
the particle also has a finite probability to remain stationary in a given timestep. The 
position of the particle on the state space {1,2,..., L} is thus a Markov chain with 
transition matrix 



v 



o 

V 

l-p-q 




\ 



(1.42) 



i - p - q/ 



It is a simple exercise to show that this Markov chain has a limiting (stationary) 
distribution with probability 1 / L for the particle to be found on any given site - an 
intuitively obvious result! Slightly more interesting, from the nonequilibrium point 
of view, is the particle current J n (oj) which we define as the net number of clockwise 
jumps made by the particle per timestep. This is a function of the form ( 11.7b with 



(1.43) 



A straightforward calculation shows that the mean stationary current is given by 
{Jn) = p — q- We shall see in Sec. 11.4.31 that this corresponds to the concentration 
point of the probability distribution P(J n = j) which satisfies an LDP. For now, 
note that there is an obvious qualitative difference between the case of p = q (zero 
mean current) and the case of p ^ q (non-zero mean current). Mathematically, this 
difference is just the distinction between a reversible and a non-reversible Markov 
chain, in the sense of detailed balance. As explained in Sec. 11.2.31 we identify the 
former case with an equilibrium system and the latter with a nonequilibrium system. 

The continuous-time version of this random walk can be understood by associating 
a physical time increment At with each discrete timestep (where above we implicitly 



Hugo Touchette and Rosemary J. Harris: Large deviation approach to nonequilibrium systems — 
Chap. 1 — 2012/4/25 — 0:18 — page 15 



15 



assumed At = 1), setting the hopping probabilities per timestep to pAt and qAt and 
then taking the limit At — > 0. Formally, our particle then remains at a given site 
for an exponentially distributed waiting time, with mean l/(p + q), before moving 
clockwise, with probability p/(p + q), or anti-clockwise, with probability q/(p + q). 
Note, in particular, that p and q are now interpreted as rates rather than probabilities 
and can each be greater than unity. The infinitesimal generator corresponding to this 
process is 



G 



(-P - 1 

q 
o 

V p 



p 

-p- q 
q 

o 





p 

-p- q 
o 



\ 



qj 



(1.44) 



Unsurprisingly, a picture similar to the discrete-time case emerges: the process has 
a stationary state which has mean density 1 / L on each site and mean current p — q. 
Following the previous sections, we shall be interested next in deriving such mean 
values as concentration points of LDPs. To be specific, we shall illustrate the general 
discussion below with explicit calculations related to the continuous-time random 
walk model and various modifications of it. We note that, as mentioned in Sec. 11.2.21 
the appropriate scaling limit for which an LDP holds depends on the observable we 
wish to consider. 



1.4.2 

Large deviation principle for density profiles 



A general interacting particle system has a state space consisting of all possible 
particle configurations and, on a coarse-grained scale, one is often interested in the 
probability of observing a particular fixed-time density profile in space. This leads 
to the concept of a density function LDP which can be straightforwardly extended 
from equilibrium to nonequilibrium; see e.g., J3|. 

The appropriate scaling limit expressing such an LDP is the infinite volume (and 
infinite particle number) limit. To be precise, for a system defined on a lattice of 
linear size L in d dimensions, one considers taking the thermodynamic limit L — > oo 
whilst rescaling the coordinates r to x = r/L. The probability of seeing a given 
density profile p(x) is then expected to obey 

P[p(x)] « cxp {-L d ^[p(x)]} (1.45) 

as L — > oo. This is afunctional LDP, as P and T are both functionals of /o(x)o The 
use of the square brackets emphasizes this point. 

For equilibrium systems with short-range interactions, the form of the large devi- 
ation rate functional T is obtained from the knowledge of f(p), the free energy per 

3) T is the large deviation analogue of the Ginzburg-Landau free energy expressed as a function of the 
particle density in the grand-canonical ensemble. 
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site, as 



^[p(x)] = / [/(p(x)) - /(p*) - (p(x) - p*)/'(p*)] dx, (1.46) 



where p* is just the mean number of particles per site Q]. We see here that T — 
for the uniform density profile p(a-) = p* . In other words, as expected, p* is the typ- 
ical density about which the probability distribution concentrates in the large volume 
limit. 

To illustrate these results, let us consider a collection of independent random walk- 
ers in one dimension, discussed as a special case in 13]. To facilitate later generaliz- 
ations, we now choose to work with open boundaries rather than periodic boundary 
conditions. Specifically, we modify our set-up to consider L sites coupled to left and 
right boundary reservoirs with densities p j, and p^ respectively, so that particles are 
input from the left reservoir with rate pp^ and from the right reservoir with rate qpR. 
In the bulk, and for exiting the system, each particle independently has the dynamics 
of the single random walker defined in Sec. llAll above. 

For equal reservoir densities, Pl = PR = P*> it is a relatively simple exercise to 
show that for any p, q, the number of particles on each site is a Poisson distribution 
with mean p* . The corresponding free energy (see e.g., j24j]) leads to a large deviation 
functional of the form 



P[p(x)] 



dx. (1.47) 



A Taylor expansion of the integrand readily demonstrates Gaussian fluctuations about 
p(x) = p*. Note that, in this special case of equal reservoir densities, the form of 
T is the same for p = q and p / q: it is a local, convex, functional as is typically 
the case for equilibrium. Furthermore, an ensemble equivalence argument suggests 
that in the L — > oo limit, the density large deviation functional would be the same for 
periodic boundary conditions with a fixed mean density p* . 

For interacting particle systems with non-equal reservoir densities (i.e., boundary 
driving) the situation is much more interesting. In particular, the density large de- 
viation functional is generically expected to have a non-local structure reflecting the 
long-range spatial correlations characteristic of nonequilibrium. This is seen for ex- 
ample, in the case of the symmetric simple exclusion process I25LI26I1 which can be 
treated analytically by using the well-known matrix product ansatz 12711 and an associ- 
ated additivity property. In the corresponding asymmetric simple exclusion process, 
J" is non-convex for some parameters indicating a phase transition liR |231 . The 
form of T in certain models can be obtained by utilising the macroscopic fluctuation 
theory of Bertini et al. I5I I30I1 to which we shall return in Sec. 11.4. 5] 

1.4.3 

Large deviation principle for current fluctuations 



The presence of non-zero currents is a generic feature of nonequilibrium stationary 
states. For Markov processes, one generically finds that a given current (e.g., 
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the net number of particles hopping between two lattice sites) time-averaged over the 
interval [0, T] obeys a large deviation principle with speed T, i.e., 



P( Jt = j) « e 



'TI(j) 



(1.48) 



The relevant scaling limit here is the long-time, T — > oo, limit. Although this may 
be combined with an infinite volume limit (as in Sec. ll.4.5l below). there is particular 
interest in current fluctuations in small systems, e.g., trapped colloidal particles (see 
the chapter of this book by Reid et al.) or single molecule biological experiments 
(see the contribution by Alemany et al.). In this spirit, we use this section to illustrate 
the calculation and properties of IN) for a single random walker. For a treatment of 
general Markov diffusions, see Ell. 

As follows from the general discussion in Sec. 11.3.31 the rate function I(j) can be 
obtained as the Legendre-Fenchel transform of the SCGF \(k) of Jt, which, for a 
continuous-time process with finite state-space, is given by the principal eigenvalue 
((k) of a tilted generator. For example, for the single-particle random walker on a 
ring with a current Jt defined, as in Sec. 11.4.11 to be the net number of clockwise 
jumps the particle makes per unit time, then we need the principal eigenvalue of the 
matrix 



( 



G(k) = 



pe 



k 



-p-q 

— k 

qe —p — q 

qe~ k -p 





pe k 







V pe k 











qe~ k \ 





-P- <?/ 



It is easy to show that the normalized vector (1/L, l/L, . 
of this matrix with eigenvalue 

~p(l-e k )-q(l~e- k ) 



(1.49) 



. , 1 / L) is a left-eigenvector 



(1.50) 



and an appeal to Perron-Frobenius theory 13211 supports the assertion that this is the 
desired principal eigenvalue £(fc) which is equal to the SCGF A(fc). From here it is a 
straightforward, albeit tedious, exercise to calculate I(j): 



I{j) = max{kj 

k 



\(k)}=p + q 



V j 2 + ^pq 



■j\n 



j + V P + 4pg 
2p 



(1.51) 



Note that this result can also be obtained by arguing that the clockwise jumps form a 
Poisson process with rate p, whereas the anti-clockwise jumps form a Poisson process 
with rate q. Considering the long-time limit of the Poisson process, we hence have 
separate large deviation functions for the clockwise current J+ and the anti-clockwise 
current J_ with respective rate functions 



=P-J++ 3+ In — > ) = q~j-+ J- In — • 

P 1 



(1.52) 



The rate function for the net current Jt — J+ — J- can then be obtained by the 
method of contraction discussed in Sec. I L3.il 
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Notice that the rate function for the current of the random walker obeys the fluctu- 
ation relation symmetry 

H-j)-m = c3 (1-53) 
with c = ln(p/q), so that 

pn T = J \ = eTcJ - ^ 

P(Jt = -3) 

This result is a simple example of a fluctuation relation of the Gallavotti-Cohen 
type I33H34H35I1 . As mentioned in Sec. 11.3.31 this result can also be expressed as the 
SCGF property X(k) = A(— fc — c) which derives ultimately from a straightforwardly- 
verified symmetry of the tilted generator: 

G(fc) T = G(-fc- c) (1.55) 

where T denotes here transpose (not time). Note that for other currents (e.g., counting 
the jumps across just a single bond), G(k) T is no longer equal to G(— k — c) but, 
under quite general conditions, is related to it by a similarity transform so that the two 
matrices have identical eigenvalues and the symmetry [ 11.54b still holds. The diagonal 
change-of-basis matrix in the similarity transform is related to current boundary terms 
which, for finite state-space, are irrelevant in the long-time limit. The plethora of 
different finite-time fluctuation relations can be associated with different choices for 
these boundary terms; see, e.g., I23II36I1 and elsewhere in this volume. 

We shall see with an example below that, for infinite state space, the boundary terms 
may become relevant. In passing, we also note that whilst the fluctuation theorem can 
be elegantly expressed as a property of the large deviation rate function, the existence 
of a large deviation principle is not, as sometimes believed, a necessary prerequisite 
for the existence of a fluctuation relation of form i ll. 54b . A simple counter-example 
is provided by a random walker with right and left hopping rates increasing in time 
aspxi and q x t respectively. It is easy to show that such a system has no stationary 
state (the mean current increases indefinitely), but since the ratio of rates of right and 
left steps is constant, a relation of the form i ll. 54b still holds. 



1.4.4 

Interacting particle systems: features and subtleties 

Thus far, the explicit examples of this section have been concerned with single- 
particle random walks or non-interacting collections thereof. The same general 
formalism applies for interacting particle systems Ijil . although analytically tract- 
able models are the exception rather than the rule. Paradigmatic examples include 
the symmetric and asymmetric simple exclusion processes, mentioned already in 
Sec. II. 4. 2l and the zero-range process (ZRP) f3^1 . Among other results, the current 
large deviations in the open-boundary asymmetric exclusion process have recently 
been calculated Here we concentrate on the ZRP with open boundary 
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conditions l4lll (connected to so-called Jackson networks of queueing theory (U) 
in order to exemplify subtleties arising in systems with unbounded state space. 

For our purposes, it suffices to consider the ZRP on a one-dimensional open lattice, 
although related issues have also been examined for queuing models on more com- 
plicated geometries J4^]. In one dimension, each site I G {1,2, . . . , L} contains an 
integer number of particles n ; which can hop to the nearest neighbour sites according 
to a continuous-time dynamics. Specifically, in the bulk the topmost particle on each 
site hops to the right (or to the left) with rate pw n (respectively, qw n ) where w n is a 
function of the number of particles n on the departure site. Particles are injected onto 
site 1 (or L) with rate a (respectively, 5) and extracted with rate -ywn (respectively, 
Pwn)- 

The properties of the model depend crucially on the function w n . The choice w n oc 
n corresponds to non-interacting particles, such as the random walkers considered 
above, whereas other forms represent an effective on-site attraction or repulsion. In 
particular, if w n is bounded as n — > oo, i.e., 

lim w n = a < oo, (1.56) 

n— >oo 

then the model exhibits a condensation transition where, for some choices of bound- 
ary rates, particles "pile up" indefinitely at one or more sites and there is no stationary 
state. For boundary rates outside this regime, the stationary state of the model is a 
product measure characterized by a site-dependent fugacity. The mean current across 
each bond (i.e., between each pair of sites) depends on the rates p, q, a, f3, 7, S, but 
not explicitly on w n ■ However, the form of w n determines the relationship between 
the fugacity and the particle density and also the relationship between a and S and 
effective reservoir densities at the boundaries. 

To calculate the fluctuations around the stationary-state current, we can try to look, 
as above, for the principal eigenvalue of a tilted generator (which can be represented 
in terms of tensor products of matrices encoding the particle dynamics on each site). 
The form of this tilted generator will depend on the bond(s) across which we choose to 
measure the current. In the case where w n is unbounded, in the sense that w n — > 00 as 
n — > 00, then the spectrum of the tilted generator is always gapped and the Legendre 
transform of the principal eigenvalue, which can be explicitly calculated in terms of 
the transition rates, gives the large deviation rate function for all values of current. 
Furthermore, as might be expected, the principal eigenvalue, and hence the current 
fluctuations, are the same for currents across all bonds. 

On the other hand, for w n bounded, the spectrum of the tilted generator becomes 
gapless for some values of k and certain boundary terms can also diverge. Mathem- 
atically, this means that X(k) is no longer simply given by the principal eigenvalue. 
Physically, this possibility is related to the fact that, over long-but-finite timescales, 
an arbitrarily large number of particles can accumulate on each site. This manifests 
in the following properties of the current large deviation function: 

• It is bond inhomogeneous so that the probability of seeing extreme current fluctu- 
ations depends on where the current is measured; 

• It depends on the initial probability distribution of the system; 
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• It does not obey the Gallavotti-Cohen fluctuation relation for large current fluctu- 
ations. 

All of these properties are seen even in the single site zero-range process, for which 
the complete spectrum of the tilted generator and the form of X(k) for all k can be 
explicitly obtained Ij^Hl . A phase diagram in x— k space, where x is the fugacity 
characterizing the initial state, reveals that there are two types of "phase transition" in 
A(fe) analogous to first-order and continuous transitions in equilibrium. At the former, 
A(fe) has a non-differentiable point, while at the latter, it remains differentiable. An 
attentive reader may question how we can then obtain I (J), since the Gartner-Ellis 
Theorem of Sec. ll.3.H requires differentiability of \(k) for all k. In fact, the Legendre- 
Fenchel transform yields the convex envelope of which contains linear sections 
corresponding to the non-differentiable points of \(k). For the ZRP, one can argue on 
physical grounds that this is the correct form of because the most probable way 
to realise an average current j in the linear regime involves a phase separation in time 
with the system spending part of its history in a state with one average current and 
part in a state with another average current. This argument relies on the system having 
only short-range correlations in time (just as the analogous Maxwell construction in 
equilibrium requires short-range correlations in space) so it might be expected to fail, 
for example, in non-Markovian systems. 

1.4.5 

Macroscopic fluctuation theory 

Underlying the so-called macroscopic fluctuation theory is the concept of the hy- 
drodynamic limit which describes the emergence of a deterministic coarse-grained 
description from stochastic microscopic rules l45ll . Recall from Sees. 1 1 ,2l and l 1 ,3l that 
such a non-fluctuating macroscopic state, corresponding to the concentration point of 
some probability distribution, is expected to be given by the zero of a large deviation 
rate function. In this subsection, we sketch the approach of Bertini et al. SHI for 
calculating this rate function. 

We are interested here in systems with particle conservation in the bulk and a key 
ingredient is the functional form of the dependence of the instantaneous local current 
on the density. The correct scaling required so that the joint distribution of current and 
density profiles concentrates in the limit L — > oo depends on the form of this current- 
density relationship. Specifically, we focus our attention here on diffusive processes 
for which the relevant macroscopic coordinates are x = r/Landr = t/L 2 . This class 
of systems includes symmetric and weakly-asymmetric versions of both the exclusion 
process and the zero-range process, but not their asymmetric counterparts for which 
Euler scaling r — t/L is needed. To illustrate loosely the procedure involved in 
taking the hydrodynamic limit, we now return to our favourite example of random 
walkers. 

Consider a collection of non-interacting particles on a one-dimensional lattice with 
boundary reservoirs pi, and pp>, as in Sec. 1 1.4.21 and a weak asymmetry in the bulk 
hopping dynamics, viz., p = 1/2 + E/2L and q — 1/2 — E/2L. The starting point 
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for the hydrodynamic description is to consider the mean current between two neigh- 
bouring lattice sites, I and l + l, say. In terms of the site densities (mean occupation 
numbers) pi and pi+\, the mean current is 

(Jl,l+l) = PPl - QPl+l, (1-57) 
which yields a lattice continuity equation of the form 

= {Jl - ld) ~ <Jm+i) = bft-lW - m(t)] - \pPl(t) - qpi+x{t)]. (1.58) 
Now writing t = L 2 r and I = xL we have 

IJ^A = [pp(x-^,T)-qp(x,T)] - [pp(x,T)-qp(x+i,T)] . (1.59) 

Then assuming local stationarity and carrying out a Taylor expansion to second order, 
yields 



dp(x,r) _ d 
dr dx 



2 ox 



= -^-j(x,r) (1.60) 



with J(x, t) a rescaled current. 

Similarly, for general d-dimensional diffusive systems obeying Fick's Law at the 
macroscopic level, we expect 

3(x,*) = tr|p(x,t)]E-D|>(x ) t)]Vp(x,t) (1.61) 

where D[p(x, t)] is the diffusivity associated with the density profile p(x, t) and 
<r[p(x, t)] is the corresponding mobility. The hydrodynamic equation, describing the 
deterministic or macroscopic limit as L — > oo, is 

^F 1 = -V.J(x,r). (1.62) 
or 

Together, Eq. J1.6U and ( 11.62b only represent the macroscopic or typical behavior 
obtained in the hydrodynamic limit. To describe the fluctuations around this limit, let 
us now find the joint rate function of the density and current. Specializing to the case 
E = 0, one observes (see, e.g., fll) that for boundary reservoirs with equal density 
p* the fluctuations of the microscopic current across each bond can be characterized 
by 

lim <^> = (1.63) 

t— >oo t L 

At the macroscopic level, this motivates adding to J(x, r) a term representing Gaus- 
sian white noise with variance cr[p(x, t)], which leads to an LDP for the joint prob- 
ability of seeing a particular density and current profile having the form 

P[p(x,r),J(x,r)]~exp<-^/ / ^^^P^- ^} 
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(1.64) 



with J and p linked by the continuity equation i ll. 62b . 

In principle, from here one can use the tools of variational calculus to look for 
the optimal density profile p(x, r) leading to a given final density p(x). This con- 
traction leads to an implicit expression for the density rate function _F[p(x)] defined 
in Sec. 11.4.21 Finding explicit solutions is a difficult task, since in general the op- 
timal profile is time-dependent. However, successful treatments along these lines 
include the one-dimensional zero-range and Kipnis-Marchioro-Presutti models lifil . 
01; m l3(ill it is also shown that the approach is consistent with the independent res- 
ults of Derrida et al. I25LI26I1 for the symmetric simple exclusion process. 

It turns out to be easier to calculate the current large deviation function l47ll . in 
particular, if one assumes that the optimal profile leading to a particular current fluc- 
tuation is independent of time and that the optimal current is constant in space. In 
one dimension, the first assumption can be shown l48Tl to be equivalent to the ad- 
ditivity principle of Bodineau and Derrida , which is known to break down in 
systems with dynamical phase transitions; the second assumption is important for 
higher-dimensional systems lifill . Under these conditions, the rate function for the 
(rescaled) current is then given by 



For any particular model, this integral must be minimized with p(x) matched to the 
reservoir densities at the boundaries. This generically yields a current distribution 
with non-Gaussian tails even though the fluctuations themselves are locally Gaussian. 

Various scenarios for the form of the macroscopic current large deviation function 
(including those indicating dynamical phase transitions) are discussed in l47h . The 
example models treated there include the one-dimensional zero-range process with 
w n unbounded, the special case w n = n corresponding again to non-interacting 
random walkers. It has recently been pointed out that, if Eq. dl.65t holds, the optimal 
density profile is the same for all currents with the same magnitude |J| leading to 
what has been dubbed an isometric fluctuation relation l5lll . Macroscopic fluctuation 
theory has also been generalized to treat models with dissipated energy ii3l . 

1.5 

Final remarks 

In this chapter we have merely skimmed the surface of the way in which large devi- 
ation theory can provide a framework for understanding existing results in the theory 
of nonequilibrium systems and probing for new ones. We conclude here with some 
pointers to other relevant work and ideas for future research directions. 

Firstly, we note that the study of fluctuation theorems and relations, as briefly 
touched on in Sees. II. 3.31 and [L431 is a vast subject which percolates through many 
of the contributions in this volume; for overviews, see for example the chapters by 




(1.65) 
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Spinney and Ford, Gaspard, and Rondoni and Jepps. In particular, we have not dis- 
cussed here the important concept of entropy production and its distribution, which is 
central for many fluctuation relation statements and can often be expressed in a large 
deviation form. 

A second topic worth mentioning is the extension of the concept of statistical en- 
semble, discussed for equilibrium systems in Sec. 11.3.21 to nonequilibrium systems. 
Just as one distinguishes equilibrium systems with fixed energy (microcanonical en- 
semble) from systems with the energy fixed on average via a conjugate Lagrange 
parameter (canonical ensemble), one can construct a microcanonical ensemble of 
trajectories for a nonequilibrium system that is constrained to realise a particular ob- 
servable value (e.g., a particular current) or a canonical ensemble of trajectories that 
realise that constraint on average. An example of the former ensemble, obtained for 
the ASEP on a ring conditioned on enhanced flux, has recently been analyzed in l53ll . 
A study of canonical-type nonequilibrium ensembles, which are also known as biased 
ensembles, can be found in the work of Sollich and Jack In the context of large 
deviation theory, these ensembles can be understood in terms of conditional LDPs 
and the so-called Gibbs conditioning principle Q]. 

Related to the topic of nonequilibrium ensembles is the issue of determining con- 
figurations or states giving rise to fluctuations. We can already get information about 
the most probable (typical) way to realise a given current fluctuation from the eigen- 
vector corresponding to the principal eigenvalue of the tilted generator ( 11.381 1. More 
work is still needed to understand the properties and correlations of such current- 
carrying states and constrained states in general. The associated inverse problem 
of determining microscopic rates which are most likely to yield given macroscopic 
properties has been studied by Evans f55ll56ll and Monthus |H3]. 

Throughout this chapter we have assumed that non-equilibrium systems of in- 
terest are modelled by Markov processes. However, the memoryless property may 
be an inappropriate approximation for the description of many systems where long- 
range temporal correlations are known to be important; see e.g., I5H1 and references 
therein. Recent work to characterize the large deviation properties of certain classes 
of history-dependent models can be found in Ij^l and foOl . In particular, it is shown 
in l59ll that modifying a continuous-time random walker (as introduced in Sec. II. 4. ll 
so that the hopping rates at time t depend on the average current up to time t can 
lead to an altered "speed" (i.e., power of time T) in the LDP for current. Fluctuation 
relations with the right-hand side of i ll. 39b replaced by e T cm have also appeared in 
the context of anomalous dynamics; see the contribution by Klages et al. in this book. 
There is much scope for future work investigating many-particle non-Markovian pro- 
cesses and establishing a common framework for the results. In this regard, and in 
a more general way, we expect the large deviation formalism to continue playing an 
important role in quantifying nonequilibrium fluctuations in small systems. 
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